Back

Research Synthesis Methods

17 training papers 2019-06-25 – 2026-03-07

Top medRxiv preprints most likely to be published in this journal, ranked by match strength.

1
Accuracy and efficiency of using artificial intelligence for data extraction in systematic reviews. A noninferiority study within reviews
2026-02-27 public and global health 10.64898/2026.02.25.26347053
#1 (19.3%)
Show abstract

BackgroundSystematic reviews are important for informing public health policies and program selection; however, they are time- and resource-intensive. Artificial intelligence (AI) offers a solution to reduce these labour-intensive requirements for various aspects of systematic review production, including data extraction. To date, there is limited robust evidence evaluating the accuracy and efficiency of AI for data extraction. This study within a review (SWAR) aimed to determine whether human d...

2
Removing animal and nonhuman records in Ovid Embase: A comparison of 11 filters
2026-02-17 health informatics 10.64898/2026.02.13.26346239
#1 (17.1%)
Show abstract

IntroductionSeveral filters are routinely used to remove animal or nonhuman records in Ovid Embase, despite there being no performance data for them. The filters take different approaches in design. ObjectiveTo understand and compare the impact of 11 filters to remove animal or nonhuman records in Ovid Embase. To understand the indexing of relevant subject headings in Embase. MethodsTo assess filter performance, we screened and categorised 3,000 records as should be removed or should be reta...

3
Systematic reviews in minutes to hours using artificial intelligence
2026-02-10 health informatics 10.64898/2026.02.06.26345764
#1 (14.7%)
Show abstract

Systematic reviews are used in academia, biotechnology, pharmaceutical companies and government to synthesise and appraise large numbers of publications. The current (largely manual) workflow takes an average of 9-18 months1, at a cost of $100,000+ per review2. We built a platform, ScholaraAI, that leverages artificial intelligence to cut this to < 0.1% of the time, without compromising quality. ScholaraAI facilitates end-to-end systematic reviews; search, screening, data extraction, and analysi...

4
Collaborative large language models (LLMs) are all you need for screening in systematic reviews
2026-02-17 health informatics 10.64898/2026.02.07.26345640
#1 (13.7%)
Show abstract

BackgroundThe ability of large language models (LLMs) to work collaboratively and screen studies in a systematic review (SR) is under-explored. Hence, we aimed to evaluate the effectiveness of LLMs in automating the process of screening in systematic reviews. MethodsThis is an observational study which included labeled data (title and abstracts) for five SRs. Originally, two reviewers screened the citations independently for eligibility. A third reviewer cross-checked each citation for quality ...

5
Time-to-retraction and likelihood of evidence contamination (VITALITY Extension I): a retrospective cohort analysis
2026-02-24 epidemiology 10.64898/2026.02.20.26346631
#1 (11.2%)
Show abstract

BackgroundThe number of problematic randomized clinical trials (RCTs) has risen sharply in recent decades, posing serious challenges to the integrity of the healthcare evidence ecosystem. ObjectiveTo investigate whether retraction of problematic RCTs could reduce evidence contamination. DesignRetrospective cohort study SettingA secondary analysis of the VITALITY Study database. Participants1,330 retracted RCTs with 847 systematic reviews. MeasurementsThe difference in the median number (and...

6
An E-value-Informed Sensitivity Analysis Framework for Hybrid Controlled Trials
2026-03-06 epidemiology 10.64898/2026.03.05.26347653
Top 0.3% (3.5%)
Show abstract

Hybrid controlled trials (HCTs) incorporate real-world data into randomized controlled trials (RCTs) by augmenting the internal control arm with patients receiving the same treatment in routine care. Beyond increasing power, HCTs may improve recruitment by supporting unequal randomization ratios that increase patient access to experimental treatments. However, HCT validity is threatened by bias from unmeasured confounding due to lack of randomization of external controls, leading to outcome non-...

7
Evaluating a Locally Deployed 20-Billion Parameter Large Language Model for Automated Abstract Screening in Systematic Reviews
2026-03-04 health informatics 10.64898/2026.03.04.26347506
Top 0.3% (2.1%)
Show abstract

BackgroundSystematic reviews (SRs) are essential for evidence-based medicine but require extensive time and resources for abstract screening. Large language models (LLMs) offer potential for automating this process, yet concerns about data privacy, intellectual property protection, and reproducibility limit the use of cloud-based solutions in research settings. ObjectiveTo evaluate the performance of a locally deployed 20-billion parameter LLM for automated abstract screening in systematic revi...

8
Fully Automated Systematic Review Generation via Large Language Models: Quality Assessment and Implications for Scientific Publishing
2026-02-23 health informatics 10.64898/2026.02.18.26346559
Top 0.3% (2.0%)
Show abstract

Large language models (LLMs) are increasingly transforming scientific workflows, yet their application to rigorous evidence synthesis remains underexplored. Through the execution of a single Python script, we present a fully automated pipeline leveraging the Claude API to generate systematic reviews from literature search through manuscript completion without human intervention. Our pipeline processes hundreds of papers through iterative API calls for inclusion evaluation, information extraction...

9
Does the type of publisher response to integrity concerns influence subsequent citations? A cohort study.
2026-02-27 health informatics 10.64898/2026.02.25.26346683
Top 0.3% (2.0%)
Show abstract

BackgroundJournals may respond to integrity concerns by publishing an editorial response (editorial notice, expression of concern (EoC) or retraction). We investigated whether the type of editorial response affected citation rates. MethodsWe obtained citations for 172 randomised controlled trials (RCTs) with integrity concerns (41 had editorial notices, 38 EoCs and 23 retractions) and control RCTs from the same journal and year. Monthly citation rates up to 60 months before and after editorial ...

10
Integrating stakeholder perspectives in modeling routine data for therapeutic decision-making
2026-02-18 epidemiology 10.64898/2026.02.18.26346074
Top 0.4% (1.9%)
Show abstract

BackgroundRoutinely collected health data are increasingly used to generate real-world evidence for therapeutic decision-making. Yet, stakeholders, including clinicians, pharmaceutical industry representatives, patient advocacy groups, and statisticians, prioritize different aspects of data quality, analysis, and interpretation. Without explicit consideration of these perspectives, analyses risk being fragmented, misaligned with end-user needs, or lacking transparency. MethodsWe developed a sta...

11
Standardisation of terminology, calculation and reporting for assigning exposure duration to drug utilisation records from healthcare data sources: the CreateDoT framework
2026-02-19 epidemiology 10.64898/2026.02.18.26346576
Top 0.5% (1.5%)
Show abstract

BackgroundIn pharmacoepidemiological studies, days of treatment (DoT) duration associated with individual electronic drug utilization records (DUR) are usually missing. Researcher-defined duration (RDD) calculation approaches, as opposed to data-driven approaches, can be used to estimate DoT based on the specific choices and assumptions made by investigators. These are usually underreported or even undocumented. We aimed to develop a framework for the standardization of terminology, formulas, im...

12
Randomized controlled trials claiming "personalized", "individualized" and "precision" interventions: characteristics, transparency and bias
2026-02-12 medical education 10.64898/2026.02.09.26345904
Top 0.6% (1.2%)
Show abstract

The terms personalized, individualized and precision medicine are increasingly used to describe health interventions, yet their operational meaning in clinical research remains unclear. Despite extensive conceptual discussion, there is limited empirical evidence on how these labels are applied in randomized controlled trials (RCTs) and whether such trials meet standards of transparency and methodological rigor. We systematically examined 262 RCTs published between 2020 and 2022 that used the ter...

13
Can AI Match Human Experts? Evaluating LLM-Generated Feedback on Resident Scholarly Projects
2026-03-04 medical education 10.64898/2026.03.04.26346878
Top 0.7% (0.8%)
Show abstract

BackgroundDelivering timely, high-quality feedback on resident scholarly projects is labour-intensive, especially in large programmes. We developed an AI-assisted evaluation system, powered by the open-weight LLaMA-3.1 large-language model (LLM), to generate formative feedback on Family Medicine residents scholarly projects and compared its performance with expert human evaluators. MethodsWe evaluated whether the AI-generated feedback achieves comparable quality to expert feedback. The tool ing...

14
Large Language Models Readability Classification: A Variability Analysis of Sources and Metrics
2026-03-02 public and global health 10.64898/2026.02.20.26346638
Top 0.8% (0.7%)
Show abstract

AbstractAccurate health information is ineffective if patients cannot understand it. Large Language Model (LLM) health research values veridical precision; however, linguistic accessibility remains an under-examined component of output quality and usability. This study investigated two sources of variability in readability classification: differences across LLM systems and across readability metrics. The analysis tested 1,120 data points from seven systems in English and Portuguese, comparing ba...

15
Secondary Prevention of Cardiovascular Events in Patients with Overweight/Obesity in Routine Clinical Practice
2026-02-20 epidemiology 10.64898/2026.02.18.26346594
Top 0.9% (0.7%)
Show abstract

Background and AimsThe glucagon-like peptide-1 receptor agonist (GLP-1 RA) semaglutide has demonstrated efficacy for the secondary prevention of cardiovascular disease among patients with overweight/obesity without diabetes mellitus. However, the comparative effectiveness of GLP-1 RA versus other antiobesity medications (e.g. phentermine-topiramate) not been evaluated. MethodsThis was a retrospective, observational, cohort study using target trial emulation methodology using the Truveta electro...

16
Comparing Existing Algorithms for Retrieving Pregnancy-related Adverse Event Reports
2026-02-18 public and global health 10.64898/2026.02.17.26346457
Top 0.9% (0.7%)
Show abstract

BackgroundPost-marketing surveillance is essential for complementing the safety profiles of medicinal products, especially for populations generally excluded from clinical trials such as pregnant individuals. However, the absence of a standardised pregnancy indicator in the electronic transmissions of adverse event reports hampers their correct identification in pharmacovigilance databases and complicates the study of safety concerns related to pregnancy exposures. Three recently developed rule-...

17
Student Scholarly Research Programs in US Medical Schools: Cross-sectional Web Audit
2026-03-04 medical education 10.64898/2026.03.03.26347497
Top 1.0% (0.5%)
Show abstract

BackgroundParticipating in research during medical school is supported by institutional programs and may influence subsequent professional development. ObjectiveWe aimed to describe the current status and heterogeneity of scholarly research programs for medical students in the United States, including expectations, support, and key structural features. MethodsWe conducted a cross-sectional web audit of official webpages for all accredited US MD- and DO-granting medical schools (search performe...

18
Enhancing competency in clinical trials management: Findings from a multicountry trial coordinators interventional training program
2026-03-04 medical education 10.64898/2026.03.03.26347517
Top 1.0% (0.5%)
Show abstract

BackgroundClinical research coordinators play a crucial role in ensuring the scientific rigor, regulatory compliance, and operational integrity of clinical trials. However, in Africa, they often lack access to structured, competency-based training, especially in operational, regulatory, and trial management domains. This study evaluated the effectiveness of a comprehensive training intervention designed to standardize and enhance core competencies of clinical trial coordinators. MethodsWe condu...

19
Historical Perspectives in Medicine using a Large Language Model: Emulating an 18th Century Physician
2026-02-12 medical education 10.64898/2026.02.10.26345990
Top 1% (0.5%)
Show abstract

IntroductionEighteenth-century medical texts document a formative period in the evolution of clinical reasoning, yet their integration into modern medical education is limited. The traditional approach to learning the history of medicine has naturally focused on passive reading, but new approaches using AI could enable learners to interrogate and simulate the historical diagnostic logic and therapeutic paradigms. More specifically, large language models (LLMs) offer an opportunity to create inte...

20
You cant manage what you cant imagine: The Digital Health Checklist-Risk Management (DHC-RM) Tool to enhance participant protections in digital health research
2026-02-24 health policy 10.64898/2026.02.22.26346854
Top 1% (0.5%)
Show abstract

Digital health technologies are powerful-enhancing data collection, participant engagement, and personalized health interventions-yet their rapid proliferation has outpaced guidance for research participant protection. Current practice assists researchers in identifying risks but provides limited support for comprehensive risk management. To address this gap, we developed the Digital Health Checklist-Risk Management (DHC-RM) Tool, which integrates the established Digital Health Checklist with ap...